Dataset statistics
| Number of variables | 41 |
|---|---|
| Number of observations | 225493 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 63.0 MiB |
| Average record size in memory | 293.0 B |
Variable types
| Numeric | 29 |
|---|---|
| DateTime | 2 |
| Categorical | 5 |
| Boolean | 5 |
AVERAGE.ACCT.AGE has a high cardinality: 192 distinct values | High cardinality |
CREDIT.HISTORY.LENGTH has a high cardinality: 291 distinct values | High cardinality |
df_index is highly correlated with Employee_code_ID | High correlation |
disbursed_amount is highly correlated with asset_cost | High correlation |
asset_cost is highly correlated with disbursed_amount | High correlation |
State_ID is highly correlated with branch_id and 3 other fields | High correlation |
Employee_code_ID is highly correlated with df_index | High correlation |
PERFORM_CNS.SCORE is highly correlated with PERFORM_CNS.SCORE.DESCRIPTION | High correlation |
PRI.NO.OF.ACCTS is highly correlated with PRI.ACTIVE.ACCTS and 1 other fields | High correlation |
PRI.ACTIVE.ACCTS is highly correlated with PERFORM_CNS.SCORE.DESCRIPTION and 5 other fields | High correlation |
PRI.CURRENT.BALANCE is highly correlated with PRI.ACTIVE.ACCTS and 3 other fields | High correlation |
PRI.SANCTIONED.AMOUNT is highly correlated with PRI.ACTIVE.ACCTS and 3 other fields | High correlation |
PRI.DISBURSED.AMOUNT is highly correlated with PRI.ACTIVE.ACCTS and 3 other fields | High correlation |
SEC.NO.OF.ACCTS is highly correlated with SEC.ACTIVE.ACCTS and 1 other fields | High correlation |
SEC.ACTIVE.ACCTS is highly correlated with SEC.NO.OF.ACCTS and 4 other fields | High correlation |
SEC.CURRENT.BALANCE is highly correlated with SEC.ACTIVE.ACCTS and 2 other fields | High correlation |
SEC.SANCTIONED.AMOUNT is highly correlated with SEC.ACTIVE.ACCTS and 2 other fields | High correlation |
SEC.DISBURSED.AMOUNT is highly correlated with SEC.ACTIVE.ACCTS and 2 other fields | High correlation |
PRIMARY.INSTAL.AMT is highly correlated with PRI.NO.OF.ACCTS and 4 other fields | High correlation |
SEC.INSTAL.AMT is highly correlated with SEC.NO.OF.ACCTS and 4 other fields | High correlation |
NEW.ACCTS.IN.LAST.SIX.MONTHS is highly correlated with PRI.NO.OF.ACCTS and 4 other fields | High correlation |
Aadhar_flag is highly correlated with State_ID and 1 other fields | High correlation |
VoterID_flag is highly correlated with State_ID and 1 other fields | High correlation |
PERFORM_CNS.SCORE.DESCRIPTION is highly correlated with PERFORM_CNS.SCORE and 1 other fields | High correlation |
SEC.OVERDUE.ACCTS is highly correlated with SEC.NO.OF.ACCTS and 1 other fields | High correlation |
UniqueID is highly correlated with DisbursalDate | High correlation |
branch_id is highly correlated with Current_pincode_ID and 1 other fields | High correlation |
Current_pincode_ID is highly correlated with branch_id and 1 other fields | High correlation |
DisbursalDate is highly correlated with UniqueID | High correlation |
PRI.CURRENT.BALANCE is highly skewed (γ1 = 29.25624693) | Skewed |
PRI.SANCTIONED.AMOUNT is highly skewed (γ1 = 319.533663) | Skewed |
PRI.DISBURSED.AMOUNT is highly skewed (γ1 = 318.4004683) | Skewed |
SEC.NO.OF.ACCTS is highly skewed (γ1 = 27.84235193) | Skewed |
SEC.ACTIVE.ACCTS is highly skewed (γ1 = 30.4096604) | Skewed |
SEC.OVERDUE.ACCTS is highly skewed (γ1 = 24.01431522) | Skewed |
SEC.CURRENT.BALANCE is highly skewed (γ1 = 107.0091863) | Skewed |
SEC.SANCTIONED.AMOUNT is highly skewed (γ1 = 74.21689332) | Skewed |
SEC.DISBURSED.AMOUNT is highly skewed (γ1 = 74.71985798) | Skewed |
PRIMARY.INSTAL.AMT is highly skewed (γ1 = 71.5253121) | Skewed |
SEC.INSTAL.AMT is highly skewed (γ1 = 152.8457066) | Skewed |
df_index is uniformly distributed | Uniform |
df_index has unique values | Unique |
UniqueID has unique values | Unique |
PERFORM_CNS.SCORE has 111773 (49.6%) zeros | Zeros |
PRI.NO.OF.ACCTS has 111773 (49.6%) zeros | Zeros |
PRI.ACTIVE.ACCTS has 131395 (58.3%) zeros | Zeros |
PRI.OVERDUE.ACCTS has 199703 (88.6%) zeros | Zeros |
PRI.CURRENT.BALANCE has 136011 (60.3%) zeros | Zeros |
PRI.SANCTIONED.AMOUNT has 132449 (58.7%) zeros | Zeros |
PRI.DISBURSED.AMOUNT has 132559 (58.8%) zeros | Zeros |
SEC.NO.OF.ACCTS has 219731 (97.4%) zeros | Zeros |
SEC.ACTIVE.ACCTS has 221737 (98.3%) zeros | Zeros |
SEC.OVERDUE.ACCTS has 224183 (99.4%) zeros | Zeros |
SEC.CURRENT.BALANCE has 222182 (98.5%) zeros | Zeros |
SEC.SANCTIONED.AMOUNT has 221816 (98.4%) zeros | Zeros |
SEC.DISBURSED.AMOUNT has 221846 (98.4%) zeros | Zeros |
PRIMARY.INSTAL.AMT has 153544 (68.1%) zeros | Zeros |
SEC.INSTAL.AMT has 223313 (99.0%) zeros | Zeros |
NEW.ACCTS.IN.LAST.SIX.MONTHS has 174944 (77.6%) zeros | Zeros |
DELINQUENT.ACCTS.IN.LAST.SIX.MONTHS has 207647 (92.1%) zeros | Zeros |
NO.OF_INQUIRIES has 194990 (86.5%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-01 12:15:50.320040 |
|---|---|
| Analysis finished | 2022-11-01 12:24:31.730569 |
| Duration | 8 minutes and 41.41 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 225493 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 116840.0736 |
| Minimum | 0 |
|---|---|
| Maximum | 233153 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 11860.6 |
| Q1 | 58759 |
| median | 116929 |
| Q3 | 175107 |
| 95-th percentile | 221622.4 |
| Maximum | 233153 |
| Range | 233153 |
| Interquartile range (IQR) | 116348 |
Descriptive statistics
| Standard deviation | 67261.70244 |
|---|---|
| Coefficient of variation (CV) | 0.5756732289 |
| Kurtosis | -1.196648141 |
| Mean | 116840.0736 |
| Median Absolute Deviation (MAD) | 58174 |
| Skewness | -0.004018369466 |
| Sum | 2.634661872 × 1010 |
| Variance | 4524136616 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 155617 | 1 | < 0.1% |
| 155606 | 1 | < 0.1% |
| 155607 | 1 | < 0.1% |
| 155608 | 1 | < 0.1% |
| 155609 | 1 | < 0.1% |
| 155610 | 1 | < 0.1% |
| 155611 | 1 | < 0.1% |
| 155612 | 1 | < 0.1% |
| 155613 | 1 | < 0.1% |
| Other values (225483) | 225483 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 233153 | 1 | |
| 233152 | 1 | |
| 233151 | 1 | |
| 233150 | 1 | |
| 233149 | 1 | |
| 233148 | 1 | |
| 233147 | 1 | |
| 233146 | 1 | |
| 233145 | 1 | |
| 233144 | 1 |
| Distinct | 225493 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 535677.4538 |
| Minimum | 417428 |
|---|---|
| Maximum | 671084 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 417428 |
|---|---|
| 5-th percentile | 429179.6 |
| Q1 | 476481 |
| median | 535593 |
| Q3 | 594774 |
| 95-th percentile | 642328.4 |
| Maximum | 671084 |
| Range | 253656 |
| Interquartile range (IQR) | 118293 |
Descriptive statistics
| Standard deviation | 68337.22275 |
|---|---|
| Coefficient of variation (CV) | 0.1275715867 |
| Kurtosis | -1.198808756 |
| Mean | 535677.4538 |
| Median Absolute Deviation (MAD) | 59148 |
| Skewness | 0.001876839031 |
| Sum | 1.207915161 × 1011 |
| Variance | 4669976013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 420825 | 1 | < 0.1% |
| 492534 | 1 | < 0.1% |
| 585417 | 1 | < 0.1% |
| 571939 | 1 | < 0.1% |
| 591840 | 1 | < 0.1% |
| 631630 | 1 | < 0.1% |
| 421233 | 1 | < 0.1% |
| 585875 | 1 | < 0.1% |
| 490346 | 1 | < 0.1% |
| 644368 | 1 | < 0.1% |
| Other values (225483) | 225483 |
| Value | Count | Frequency (%) |
| 417428 | 1 | |
| 417429 | 1 | |
| 417430 | 1 | |
| 417431 | 1 | |
| 417432 | 1 | |
| 417433 | 1 | |
| 417434 | 1 | |
| 417435 | 1 | |
| 417436 | 1 | |
| 417437 | 1 |
| Value | Count | Frequency (%) |
| 671084 | 1 | |
| 671033 | 1 | |
| 658676 | 1 | |
| 658675 | 1 | |
| 658674 | 1 | |
| 658673 | 1 | |
| 658672 | 1 | |
| 658671 | 1 | |
| 658670 | 1 | |
| 658669 | 1 |
| Distinct | 24228 |
|---|---|
| Distinct (%) | 10.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54240.72883 |
| Minimum | 13320 |
|---|---|
| Maximum | 987354 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 13320 |
|---|---|
| 5-th percentile | 34839 |
| Q1 | 47049 |
| median | 53703 |
| Q3 | 60213 |
| 95-th percentile | 73817 |
| Maximum | 987354 |
| Range | 974034 |
| Interquartile range (IQR) | 13164 |
Descriptive statistics
| Standard deviation | 12775.59006 |
|---|---|
| Coefficient of variation (CV) | 0.2355349999 |
| Kurtosis | 146.7750226 |
| Mean | 54240.72883 |
| Median Absolute Deviation (MAD) | 6558 |
| Skewness | 3.069299613 |
| Sum | 1.223090467 × 1010 |
| Variance | 163215701.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 48349 | 2076 | 0.9% |
| 53303 | 2034 | 0.9% |
| 50303 | 1915 | 0.8% |
| 51303 | 1913 | 0.8% |
| 52303 | 1814 | 0.8% |
| 47349 | 1814 | 0.8% |
| 55259 | 1795 | 0.8% |
| 46349 | 1569 | 0.7% |
| 56259 | 1534 | 0.7% |
| 57259 | 1521 | 0.7% |
| Other values (24218) | 207508 |
| Value | Count | Frequency (%) |
| 13320 | 1 | < 0.1% |
| 13369 | 1 | < 0.1% |
| 13600 | 1 | < 0.1% |
| 13640 | 1 | < 0.1% |
| 13652 | 1 | < 0.1% |
| 13664 | 6 | |
| 13814 | 1 | < 0.1% |
| 13914 | 1 | < 0.1% |
| 13940 | 1 | < 0.1% |
| 13941 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 987354 | 1 | |
| 592460 | 1 | |
| 332045 | 1 | |
| 318533 | 1 | |
| 315904 | 1 | |
| 237779 | 1 | |
| 196998 | 1 | |
| 191392 | 1 | |
| 190887 | 1 | |
| 187787 | 1 |
| Distinct | 45415 |
|---|---|
| Distinct (%) | 20.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 75631.13188 |
| Minimum | 37000 |
|---|---|
| Maximum | 1328954 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 37000 |
|---|---|
| 5-th percentile | 58170 |
| Q1 | 65625 |
| median | 70807 |
| Q3 | 78966 |
| 95-th percentile | 109031.8 |
| Maximum | 1328954 |
| Range | 1291954 |
| Interquartile range (IQR) | 13341 |
Descriptive statistics
| Standard deviation | 18527.57573 |
|---|---|
| Coefficient of variation (CV) | 0.2449728738 |
| Kurtosis | 110.3653432 |
| Mean | 75631.13188 |
| Median Absolute Deviation (MAD) | 6157 |
| Skewness | 4.055663725 |
| Sum | 1.705429082 × 1010 |
| Variance | 343271062.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 68000 | 674 | 0.3% |
| 67000 | 583 | 0.3% |
| 72000 | 526 | 0.2% |
| 70000 | 491 | 0.2% |
| 74000 | 461 | 0.2% |
| 66000 | 457 | 0.2% |
| 75000 | 453 | 0.2% |
| 73000 | 450 | 0.2% |
| 69000 | 430 | 0.2% |
| 65000 | 392 | 0.2% |
| Other values (45405) | 220576 |
| Value | Count | Frequency (%) |
| 37000 | 2 | |
| 37129 | 1 | |
| 37230 | 1 | |
| 37310 | 1 | |
| 37377 | 1 | |
| 37658 | 1 | |
| 37816 | 1 | |
| 38055 | 2 | |
| 38059 | 1 | |
| 38063 | 1 |
| Value | Count | Frequency (%) |
| 1328954 | 1 | |
| 715186 | 1 | |
| 459625 | 1 | |
| 388025 | 1 | |
| 383600 | 1 | |
| 378092 | 2 | |
| 286350 | 1 | |
| 281164 | 1 | |
| 280100 | 1 | |
| 277600 | 1 |
ltv
Real number (ℝ≥0)
| Distinct | 6541 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74.80663386 |
| Minimum | 13.5 |
|---|---|
| Maximum | 95 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 13.5 |
|---|---|
| 5-th percentile | 52.42 |
| Q1 | 68.96 |
| median | 76.89 |
| Q3 | 83.73 |
| 95-th percentile | 89.39 |
| Maximum | 95 |
| Range | 81.5 |
| Interquartile range (IQR) | 14.77 |
Descriptive statistics
| Standard deviation | 11.4418905 |
|---|---|
| Coefficient of variation (CV) | 0.1529528854 |
| Kurtosis | 1.293300679 |
| Mean | 74.80663386 |
| Median Absolute Deviation (MAD) | 7.25 |
| Skewness | -1.076667482 |
| Sum | 16868372.29 |
| Variance | 130.9168582 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 85 | 4298 | 1.9% |
| 84.99 | 1018 | 0.5% |
| 79.99 | 536 | 0.2% |
| 80 | 480 | 0.2% |
| 75 | 415 | 0.2% |
| 79.9 | 402 | 0.2% |
| 79.79 | 387 | 0.2% |
| 74.93 | 374 | 0.2% |
| 90 | 328 | 0.1% |
| 89.86 | 327 | 0.1% |
| Other values (6531) | 216928 |
| Value | Count | Frequency (%) |
| 13.5 | 1 | |
| 14.17 | 1 | |
| 15.3 | 1 | |
| 15.58 | 1 | |
| 16.6 | 1 | |
| 17.02 | 1 | |
| 17.05 | 1 | |
| 17.13 | 1 | |
| 17.36 | 1 | |
| 18 | 1 |
| Value | Count | Frequency (%) |
| 95 | 8 | < 0.1% |
| 94.99 | 7 | < 0.1% |
| 94.98 | 9 | |
| 94.97 | 5 | < 0.1% |
| 94.96 | 11 | |
| 94.95 | 14 | |
| 94.94 | 13 | |
| 94.93 | 20 | |
| 94.92 | 17 | |
| 94.91 | 13 |
| Distinct | 82 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 73.07061417 |
| Minimum | 1 |
|---|---|
| Maximum | 261 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 14 |
| median | 61 |
| Q3 | 130 |
| 95-th percentile | 249 |
| Maximum | 261 |
| Range | 260 |
| Interquartile range (IQR) | 116 |
Descriptive statistics
| Standard deviation | 70.01414708 |
|---|---|
| Coefficient of variation (CV) | 0.9581710497 |
| Kurtosis | 0.302701006 |
| Mean | 73.07061417 |
| Median Absolute Deviation (MAD) | 50 |
| Skewness | 1.032784512 |
| Sum | 16476912 |
| Variance | 4901.980791 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 13003 | 5.8% |
| 67 | 10858 | 4.8% |
| 3 | 9214 | 4.1% |
| 5 | 9096 | 4.0% |
| 36 | 8818 | 3.9% |
| 34 | 7794 | 3.5% |
| 136 | 7128 | 3.2% |
| 19 | 5843 | 2.6% |
| 16 | 5592 | 2.5% |
| 1 | 5306 | 2.4% |
| Other values (72) | 142841 |
| Value | Count | Frequency (%) |
| 1 | 5306 | |
| 2 | 13003 | |
| 3 | 9214 | |
| 5 | 9096 | |
| 7 | 2985 | 1.3% |
| 8 | 2965 | 1.3% |
| 9 | 2354 | 1.0% |
| 10 | 3848 | 1.7% |
| 11 | 4078 | 1.8% |
| 13 | 2889 | 1.3% |
| Value | Count | Frequency (%) |
| 261 | 176 | 0.1% |
| 260 | 339 | 0.2% |
| 259 | 345 | 0.2% |
| 258 | 374 | 0.2% |
| 257 | 1256 | 0.6% |
| 255 | 1562 | |
| 254 | 1699 | |
| 251 | 3842 | |
| 250 | 1442 | 0.6% |
| 249 | 854 | 0.4% |
supplier_id
Real number (ℝ≥0)
| Distinct | 2945 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19645.59789 |
| Minimum | 10524 |
|---|---|
| Maximum | 24803 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 10524 |
|---|---|
| 5-th percentile | 14181 |
| Q1 | 16555 |
| median | 20333 |
| Q3 | 23004 |
| 95-th percentile | 24124 |
| Maximum | 24803 |
| Range | 14279 |
| Interquartile range (IQR) | 6449 |
Descriptive statistics
| Standard deviation | 3494.023799 |
|---|---|
| Coefficient of variation (CV) | 0.1778527596 |
| Kurtosis | -1.478569771 |
| Mean | 19645.59789 |
| Median Absolute Deviation (MAD) | 3061 |
| Skewness | -0.1710645231 |
| Sum | 4429944805 |
| Variance | 12208202.31 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 18317 | 1319 | 0.6% |
| 17980 | 1252 | 0.6% |
| 14234 | 1241 | 0.6% |
| 15663 | 1237 | 0.5% |
| 15694 | 1154 | 0.5% |
| 18166 | 1145 | 0.5% |
| 14375 | 1103 | 0.5% |
| 14115 | 1042 | 0.5% |
| 14145 | 1037 | 0.5% |
| 22727 | 1012 | 0.4% |
| Other values (2935) | 213951 |
| Value | Count | Frequency (%) |
| 10524 | 6 | < 0.1% |
| 12311 | 3 | < 0.1% |
| 12312 | 38 | |
| 12374 | 86 | |
| 12441 | 47 | |
| 12456 | 68 | |
| 12500 | 54 | |
| 12534 | 55 | |
| 12539 | 7 | < 0.1% |
| 12797 | 61 |
| Value | Count | Frequency (%) |
| 24803 | 2 | |
| 24802 | 2 | |
| 24799 | 1 | < 0.1% |
| 24797 | 2 | |
| 24794 | 1 | < 0.1% |
| 24793 | 1 | < 0.1% |
| 24790 | 1 | < 0.1% |
| 24789 | 1 | < 0.1% |
| 24787 | 2 | |
| 24785 | 3 |
manufacturer_id
Real number (ℝ≥0)
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 69.07225058 |
| Minimum | 45 |
|---|---|
| Maximum | 156 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 45 |
|---|---|
| 5-th percentile | 45 |
| Q1 | 48 |
| median | 86 |
| Q3 | 86 |
| 95-th percentile | 86 |
| Maximum | 156 |
| Range | 111 |
| Interquartile range (IQR) | 38 |
Descriptive statistics
| Standard deviation | 22.16467967 |
|---|---|
| Coefficient of variation (CV) | 0.3208912332 |
| Kurtosis | -0.7184451107 |
| Mean | 69.07225058 |
| Median Absolute Deviation (MAD) | 34 |
| Skewness | 0.3871002259 |
| Sum | 15575309 |
| Variance | 491.2730247 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 86 | 106062 | |
| 45 | 55207 | |
| 51 | 26243 | 11.6% |
| 48 | 15721 | 7.0% |
| 49 | 9700 | 4.3% |
| 120 | 9417 | 4.2% |
| 67 | 2366 | 1.0% |
| 145 | 760 | 0.3% |
| 153 | 11 | < 0.1% |
| 152 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 45 | 55207 | |
| 48 | 15721 | 7.0% |
| 49 | 9700 | 4.3% |
| 51 | 26243 | 11.6% |
| 67 | 2366 | 1.0% |
| 86 | 106062 | |
| 120 | 9417 | 4.2% |
| 145 | 760 | 0.3% |
| 152 | 5 | < 0.1% |
| 153 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 156 | 1 | < 0.1% |
| 153 | 11 | < 0.1% |
| 152 | 5 | < 0.1% |
| 145 | 760 | 0.3% |
| 120 | 9417 | 4.2% |
| 86 | 106062 | |
| 67 | 2366 | 1.0% |
| 51 | 26243 | 11.6% |
| 49 | 9700 | 4.3% |
| 48 | 15721 | 7.0% |
| Distinct | 6659 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3375.718133 |
| Minimum | 1 |
|---|---|
| Maximum | 7345 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 234 |
| Q1 | 1509 |
| median | 2949 |
| Q3 | 5682 |
| 95-th percentile | 6944 |
| Maximum | 7345 |
| Range | 7344 |
| Interquartile range (IQR) | 4173 |
Descriptive statistics
| Standard deviation | 2253.216519 |
|---|---|
| Coefficient of variation (CV) | 0.6674776834 |
| Kurtosis | -1.292294566 |
| Mean | 3375.718133 |
| Median Absolute Deviation (MAD) | 1902 |
| Skewness | 0.2942958703 |
| Sum | 761200809 |
| Variance | 5076984.684 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2578 | 1852 | 0.8% |
| 1446 | 1651 | 0.7% |
| 1515 | 1044 | 0.5% |
| 2989 | 838 | 0.4% |
| 2943 | 834 | 0.4% |
| 1509 | 821 | 0.4% |
| 2782 | 818 | 0.4% |
| 1794 | 794 | 0.4% |
| 571 | 781 | 0.3% |
| 3363 | 727 | 0.3% |
| Other values (6649) | 215333 |
| Value | Count | Frequency (%) |
| 1 | 26 | < 0.1% |
| 2 | 72 | < 0.1% |
| 3 | 50 | < 0.1% |
| 4 | 87 | |
| 5 | 215 | |
| 6 | 100 | |
| 7 | 104 | |
| 8 | 43 | < 0.1% |
| 9 | 29 | < 0.1% |
| 10 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 7345 | 7 | < 0.1% |
| 7344 | 1 | < 0.1% |
| 7343 | 2 | < 0.1% |
| 7342 | 1 | < 0.1% |
| 7341 | 7 | < 0.1% |
| 7340 | 2 | < 0.1% |
| 7338 | 2 | < 0.1% |
| 7337 | 3 | < 0.1% |
| 7336 | 20 | |
| 7335 | 1 | < 0.1% |
Date.of.Birth
Date
| Distinct | 14417 |
|---|---|
| Distinct (%) | 6.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| Minimum | 1972-01-01 00:00:00 |
|---|---|
| Maximum | 2071-12-31 00:00:00 |
Employment.Type
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| Self employed | |
|---|---|
| Salaried |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 10.8301322 |
| Min length | 8 |
Characters and Unicode
| Total characters | 2442119 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Salaried |
|---|---|
| 2nd row | Self employed |
| 3rd row | Self employed |
| 4th row | Self employed |
| 5th row | Self employed |
Common Values
| Value | Count | Frequency (%) |
| Self employed | 127635 | |
| Salaried | 97858 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| self | 127635 | |
| employed | 127635 | |
| salaried | 97858 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 480763 | |
| l | 353128 | |
| S | 225493 | |
| d | 225493 | |
| a | 195716 | |
| f | 127635 | 5.2% |
| 127635 | 5.2% | |
| m | 127635 | 5.2% |
| p | 127635 | 5.2% |
| o | 127635 | 5.2% |
| Other values (3) | 323351 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2088991 | |
| Uppercase Letter | 225493 | 9.2% |
| Space Separator | 127635 | 5.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 480763 | |
| l | 353128 | |
| d | 225493 | |
| a | 195716 | |
| f | 127635 | 6.1% |
| m | 127635 | 6.1% |
| p | 127635 | 6.1% |
| o | 127635 | 6.1% |
| y | 127635 | 6.1% |
| r | 97858 | 4.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 225493 |
Space Separator
| Value | Count | Frequency (%) |
| 127635 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2314484 | |
| Common | 127635 | 5.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 480763 | |
| l | 353128 | |
| S | 225493 | |
| d | 225493 | |
| a | 195716 | |
| f | 127635 | 5.5% |
| m | 127635 | 5.5% |
| p | 127635 | 5.5% |
| o | 127635 | 5.5% |
| y | 127635 | 5.5% |
| Other values (2) | 195716 |
Common
| Value | Count | Frequency (%) |
| 127635 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2442119 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 480763 | |
| l | 353128 | |
| S | 225493 | |
| d | 225493 | |
| a | 195716 | |
| f | 127635 | 5.2% |
| 127635 | 5.2% | |
| m | 127635 | 5.2% |
| p | 127635 | 5.2% |
| o | 127635 | 5.2% |
| Other values (3) | 323351 |
| Distinct | 84 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| Minimum | 2018-01-08 00:00:00 |
|---|---|
| Maximum | 2018-12-10 00:00:00 |
| Distinct | 22 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.241550735 |
| Minimum | 1 |
|---|---|
| Maximum | 22 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 6 |
| Q3 | 10 |
| 95-th percentile | 16 |
| Maximum | 22 |
| Range | 21 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.460856315 |
|---|---|
| Coefficient of variation (CV) | 0.6160084321 |
| Kurtosis | -0.2978858737 |
| Mean | 7.241550735 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.8287275014 |
| Sum | 1632919 |
| Variance | 19.89923906 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 44234 | |
| 6 | 32958 | |
| 3 | 31640 | |
| 13 | 17858 | |
| 9 | 15690 | 7.0% |
| 8 | 13193 | 5.9% |
| 5 | 10046 | 4.5% |
| 1 | 8922 | 4.0% |
| 14 | 8169 | 3.6% |
| 7 | 6722 | 3.0% |
| Other values (12) | 36061 |
| Value | Count | Frequency (%) |
| 1 | 8922 | 4.0% |
| 2 | 4049 | 1.8% |
| 3 | 31640 | |
| 4 | 44234 | |
| 5 | 10046 | 4.5% |
| 6 | 32958 | |
| 7 | 6722 | 3.0% |
| 8 | 13193 | 5.9% |
| 9 | 15690 | 7.0% |
| 10 | 3465 | 1.5% |
| Value | Count | Frequency (%) |
| 22 | 75 | < 0.1% |
| 21 | 152 | 0.1% |
| 20 | 182 | 0.1% |
| 19 | 952 | 0.4% |
| 18 | 5406 | 2.4% |
| 17 | 3352 | 1.5% |
| 16 | 2667 | 1.2% |
| 15 | 5032 | 2.2% |
| 14 | 8169 | |
| 13 | 17858 |
| Distinct | 3269 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1550.665453 |
| Minimum | 1 |
|---|---|
| Maximum | 3795 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 149 |
| Q1 | 713 |
| median | 1452 |
| Q3 | 2365 |
| 95-th percentile | 3187 |
| Maximum | 3795 |
| Range | 3794 |
| Interquartile range (IQR) | 1652 |
Descriptive statistics
| Standard deviation | 975.664631 |
|---|---|
| Coefficient of variation (CV) | 0.6291909252 |
| Kurtosis | -1.054844758 |
| Mean | 1550.665453 |
| Median Absolute Deviation (MAD) | 813 |
| Skewness | 0.2423795862 |
| Sum | 349664205 |
| Variance | 951921.4722 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2546 | 628 | 0.3% |
| 620 | 502 | 0.2% |
| 255 | 492 | 0.2% |
| 130 | 408 | 0.2% |
| 2153 | 401 | 0.2% |
| 1466 | 355 | 0.2% |
| 1494 | 352 | 0.2% |
| 64 | 349 | 0.2% |
| 751 | 343 | 0.2% |
| 184 | 340 | 0.2% |
| Other values (3259) | 221323 |
| Value | Count | Frequency (%) |
| 1 | 80 | |
| 3 | 132 | |
| 4 | 67 | |
| 5 | 88 | |
| 7 | 144 | |
| 9 | 56 | < 0.1% |
| 10 | 44 | < 0.1% |
| 11 | 85 | |
| 12 | 119 | |
| 15 | 83 |
| Value | Count | Frequency (%) |
| 3795 | 1 | < 0.1% |
| 3794 | 1 | < 0.1% |
| 3793 | 1 | < 0.1% |
| 3792 | 1 | < 0.1% |
| 3791 | 3 | |
| 3790 | 1 | < 0.1% |
| 3789 | 2 | |
| 3788 | 1 | < 0.1% |
| 3787 | 2 | |
| 3786 | 3 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 220.3 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 188900 | |
| False | 36593 | 16.2% |
PAN_flag
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 220.3 KiB |
| False | |
|---|---|
| True | 17450 |
| Value | Count | Frequency (%) |
| False | 208043 | |
| True | 17450 | 7.7% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 220.3 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 192317 | |
| True | 33176 | 14.7% |
Driving_flag
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 220.3 KiB |
| False | |
|---|---|
| True | 5341 |
| Value | Count | Frequency (%) |
| False | 220152 | |
| True | 5341 | 2.4% |
Passport_flag
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 220.3 KiB |
| False | |
|---|---|
| True | 482 |
| Value | Count | Frequency (%) |
| False | 225011 | |
| True | 482 | 0.2% |
| Distinct | 573 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 293.0404491 |
| Minimum | 0 |
|---|---|
| Maximum | 890 |
| Zeros | 111773 |
| Zeros (%) | 49.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 15 |
| Q3 | 680 |
| 95-th percentile | 825 |
| Maximum | 890 |
| Range | 890 |
| Interquartile range (IQR) | 680 |
Descriptive statistics
| Standard deviation | 338.8747837 |
|---|---|
| Coefficient of variation (CV) | 1.156409583 |
| Kurtosis | -1.65222874 |
| Mean | 293.0404491 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 0.4244506554 |
| Sum | 66078570 |
| Variance | 114836.119 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 111773 | |
| 300 | 8632 | 3.8% |
| 738 | 8473 | 3.8% |
| 825 | 7196 | 3.2% |
| 15 | 3671 | 1.6% |
| 17 | 3557 | 1.6% |
| 763 | 2972 | 1.3% |
| 16 | 2815 | 1.2% |
| 708 | 2062 | 0.9% |
| 737 | 1943 | 0.9% |
| Other values (563) | 72399 |
| Value | Count | Frequency (%) |
| 0 | 111773 | |
| 11 | 3 | < 0.1% |
| 14 | 957 | 0.4% |
| 15 | 3671 | 1.6% |
| 16 | 2815 | 1.2% |
| 17 | 3557 | 1.6% |
| 18 | 1477 | 0.7% |
| 300 | 8632 | 3.8% |
| 301 | 9 | < 0.1% |
| 302 | 17 | < 0.1% |
| Value | Count | Frequency (%) |
| 890 | 4 | < 0.1% |
| 884 | 1 | < 0.1% |
| 879 | 59 | |
| 878 | 7 | < 0.1% |
| 873 | 9 | < 0.1% |
| 870 | 28 | |
| 869 | 7 | < 0.1% |
| 868 | 2 | < 0.1% |
| 867 | 1 | < 0.1% |
| 864 | 8 | < 0.1% |
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| No Bureau History Available | |
|---|---|
| C-Very Low Risk | |
| A-Very Low Risk | |
| D-Very Low Risk | 11134 |
| B-Very Low Risk | 9032 |
| Other values (15) |
Length
| Max length | 55 |
|---|---|
| Median length | 53 |
| Mean length | 22.13364938 |
| Min length | 10 |
Characters and Unicode
| Total characters | 4990983 |
|---|---|
| Distinct characters | 50 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No Bureau History Available |
|---|---|
| 2nd row | I-Medium Risk |
| 3rd row | No Bureau History Available |
| 4th row | L-Very High Risk |
| 5th row | No Bureau History Available |
Common Values
| Value | Count | Frequency (%) |
| No Bureau History Available | 111773 | |
| C-Very Low Risk | 15715 | 7.0% |
| A-Very Low Risk | 13790 | 6.1% |
| D-Very Low Risk | 11134 | 4.9% |
| B-Very Low Risk | 9032 | 4.0% |
| M-Very High Risk | 8632 | 3.8% |
| F-Low Risk | 8309 | 3.7% |
| K-High Risk | 8107 | 3.6% |
| H-Medium Risk | 6695 | 3.0% |
| E-Low Risk | 5695 | 2.5% |
| Other values (10) | 26611 | 11.8% |
Length
| Value | Count | Frequency (%) |
| available | 120478 | |
| no | 116065 | |
| history | 115444 | |
| bureau | 111773 | |
| risk | 101240 | |
| low | 49671 | 6.2% |
| not | 19708 | 2.4% |
| c-very | 15715 | 1.9% |
| a-very | 13790 | 1.7% |
| scored | 12480 | 1.5% |
| Other values (35) | 130109 |
Most occurring characters
| Value | Count | Frequency (%) |
| 580980 | 11.6% | |
| i | 388092 | 7.8% |
| a | 366409 | 7.3% |
| o | 353575 | 7.1% |
| e | 342634 | 6.9% |
| r | 307411 | 6.2% |
| u | 250244 | 5.0% |
| l | 243390 | 4.9% |
| s | 230305 | 4.6% |
| y | 178641 | 3.6% |
| Other values (40) | 1749302 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3413822 | |
| Uppercase Letter | 873871 | 17.5% |
| Space Separator | 580980 | 11.6% |
| Dash Punctuation | 101240 | 2.0% |
| Other Punctuation | 12480 | 0.3% |
| Decimal Number | 2960 | 0.1% |
| Open Punctuation | 2815 | 0.1% |
| Close Punctuation | 2815 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 388092 | |
| a | 366409 | |
| o | 353575 | |
| e | 342634 | |
| r | 307411 | |
| u | 250244 | |
| l | 243390 | |
| s | 230305 | 6.7% |
| y | 178641 | 5.2% |
| t | 165409 | 4.8% |
| Other values (12) | 587712 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 143667 | |
| N | 135773 | |
| A | 132052 | |
| B | 120805 | |
| R | 101240 | |
| L | 68699 | |
| V | 59425 | |
| M | 20770 | 2.4% |
| S | 16151 | 1.8% |
| C | 15715 | 1.8% |
| Other values (9) | 59574 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 1477 | |
| 6 | 1477 | |
| 5 | 3 | 0.1% |
| 0 | 3 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 580980 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 101240 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 12480 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2815 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2815 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4287693 | |
| Common | 703290 | 14.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 388092 | 9.1% |
| a | 366409 | 8.5% |
| o | 353575 | 8.2% |
| e | 342634 | 8.0% |
| r | 307411 | 7.2% |
| u | 250244 | 5.8% |
| l | 243390 | 5.7% |
| s | 230305 | 5.4% |
| y | 178641 | 4.2% |
| t | 165409 | 3.9% |
| Other values (31) | 1461583 |
Common
| Value | Count | Frequency (%) |
| 580980 | ||
| - | 101240 | 14.4% |
| : | 12480 | 1.8% |
| ( | 2815 | 0.4% |
| ) | 2815 | 0.4% |
| 3 | 1477 | 0.2% |
| 6 | 1477 | 0.2% |
| 5 | 3 | < 0.1% |
| 0 | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4990983 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 580980 | 11.6% | |
| i | 388092 | 7.8% |
| a | 366409 | 7.3% |
| o | 353575 | 7.1% |
| e | 342634 | 6.9% |
| r | 307411 | 6.2% |
| u | 250244 | 5.0% |
| l | 243390 | 4.9% |
| s | 230305 | 4.6% |
| y | 178641 | 3.6% |
| Other values (40) | 1749302 |
| Distinct | 107 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.462360251 |
| Minimum | 0 |
|---|---|
| Maximum | 453 |
| Zeros | 111773 |
| Zeros (%) | 49.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 11 |
| Maximum | 453 |
| Range | 453 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 5.223011518 |
|---|---|
| Coefficient of variation (CV) | 2.121140283 |
| Kurtosis | 426.083019 |
| Mean | 2.462360251 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 9.857231997 |
| Sum | 555245 |
| Variance | 27.27984932 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 111773 | |
| 1 | 34154 | 15.1% |
| 2 | 19426 | 8.6% |
| 3 | 12787 | 5.7% |
| 4 | 9159 | 4.1% |
| 5 | 7079 | 3.1% |
| 6 | 5462 | 2.4% |
| 7 | 4332 | 1.9% |
| 8 | 3488 | 1.5% |
| 9 | 2815 | 1.2% |
| Other values (97) | 15018 | 6.7% |
| Value | Count | Frequency (%) |
| 0 | 111773 | |
| 1 | 34154 | 15.1% |
| 2 | 19426 | 8.6% |
| 3 | 12787 | 5.7% |
| 4 | 9159 | 4.1% |
| 5 | 7079 | 3.1% |
| 6 | 5462 | 2.4% |
| 7 | 4332 | 1.9% |
| 8 | 3488 | 1.5% |
| 9 | 2815 | 1.2% |
| Value | Count | Frequency (%) |
| 453 | 1 | |
| 354 | 1 | |
| 271 | 1 | |
| 194 | 1 | |
| 148 | 2 | |
| 147 | 1 | |
| 136 | 1 | |
| 132 | 1 | |
| 131 | 1 | |
| 124 | 1 |
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.053766636 |
| Minimum | 0 |
|---|---|
| Maximum | 144 |
| Zeros | 131395 |
| Zeros (%) | 58.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 5 |
| Maximum | 144 |
| Range | 144 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.95201484 |
|---|---|
| Coefficient of variation (CV) | 1.852416629 |
| Kurtosis | 156.3509839 |
| Mean | 1.053766636 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.376997268 |
| Sum | 237617 |
| Variance | 3.810361934 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 131395 | |
| 1 | 41050 | 18.2% |
| 2 | 21138 | 9.4% |
| 3 | 12044 | 5.3% |
| 4 | 7306 | 3.2% |
| 5 | 4447 | 2.0% |
| 6 | 2741 | 1.2% |
| 7 | 1766 | 0.8% |
| 8 | 1171 | 0.5% |
| 9 | 740 | 0.3% |
| Other values (30) | 1695 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 131395 | |
| 1 | 41050 | 18.2% |
| 2 | 21138 | 9.4% |
| 3 | 12044 | 5.3% |
| 4 | 7306 | 3.2% |
| 5 | 4447 | 2.0% |
| 6 | 2741 | 1.2% |
| 7 | 1766 | 0.8% |
| 8 | 1171 | 0.5% |
| 9 | 740 | 0.3% |
| Value | Count | Frequency (%) |
| 144 | 1 | |
| 65 | 1 | |
| 52 | 1 | |
| 43 | 1 | |
| 42 | 1 | |
| 39 | 1 | |
| 37 | 2 | |
| 35 | 2 | |
| 34 | 2 | |
| 32 | 2 |
| Distinct | 22 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1589894143 |
| Minimum | 0 |
|---|---|
| Maximum | 25 |
| Zeros | 199703 |
| Zeros (%) | 88.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 25 |
| Range | 25 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.5534152006 |
|---|---|
| Coefficient of variation (CV) | 3.480830488 |
| Kurtosis | 124.8961637 |
| Mean | 0.1589894143 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.486695923 |
| Sum | 35851 |
| Variance | 0.3062683843 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 199703 | |
| 1 | 19596 | 8.7% |
| 2 | 4226 | 1.9% |
| 3 | 1175 | 0.5% |
| 4 | 399 | 0.2% |
| 5 | 165 | 0.1% |
| 6 | 96 | < 0.1% |
| 7 | 38 | < 0.1% |
| 8 | 26 | < 0.1% |
| 9 | 24 | < 0.1% |
| Other values (12) | 45 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 199703 | |
| 1 | 19596 | 8.7% |
| 2 | 4226 | 1.9% |
| 3 | 1175 | 0.5% |
| 4 | 399 | 0.2% |
| 5 | 165 | 0.1% |
| 6 | 96 | < 0.1% |
| 7 | 38 | < 0.1% |
| 8 | 26 | < 0.1% |
| 9 | 24 | < 0.1% |
| Value | Count | Frequency (%) |
| 25 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 18 | 2 | < 0.1% |
| 17 | 2 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 14 | 5 | |
| 13 | 5 | |
| 12 | 8 |
| Distinct | 70044 |
|---|---|
| Distinct (%) | 31.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 168481.3163 |
| Minimum | -6678296 |
|---|---|
| Maximum | 96524920 |
| Zeros | 136011 |
| Zeros (%) | 60.3% |
| Negative | 436 |
| Negative (%) | 0.2% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | -6678296 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 36300 |
| 95-th percentile | 817893.4 |
| Maximum | 96524920 |
| Range | 103203216 |
| Interquartile range (IQR) | 36300 |
Descriptive statistics
| Standard deviation | 951669.1721 |
|---|---|
| Coefficient of variation (CV) | 5.648514584 |
| Kurtosis | 1597.186355 |
| Mean | 168481.3163 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 29.25624693 |
| Sum | 3.799135745 × 1010 |
| Variance | 9.056742132 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 136011 | |
| 800 | 120 | 0.1% |
| 400 | 119 | 0.1% |
| 30000 | 99 | < 0.1% |
| 100000 | 80 | < 0.1% |
| 50000 | 80 | < 0.1% |
| 40000 | 72 | < 0.1% |
| 25000 | 72 | < 0.1% |
| 20000 | 62 | < 0.1% |
| 60000 | 61 | < 0.1% |
| Other values (70034) | 88717 |
| Value | Count | Frequency (%) |
| -6678296 | 1 | |
| -2018309 | 1 | |
| -1738415 | 1 | |
| -1408314 | 1 | |
| -1306449 | 1 | |
| -1178242 | 1 | |
| -1108114 | 1 | |
| -931644 | 1 | |
| -763599 | 1 | |
| -754060 | 1 |
| Value | Count | Frequency (%) |
| 96524920 | 1 | |
| 75603400 | 1 | |
| 66406160 | 1 | |
| 63531320 | 1 | |
| 63359040 | 1 | |
| 61367688 | 1 | |
| 56385824 | 1 | |
| 56163544 | 1 | |
| 52503152 | 1 | |
| 52367960 | 1 |
| Distinct | 43743 |
|---|---|
| Distinct (%) | 19.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 222073.6394 |
| Minimum | 0 |
|---|---|
| Maximum | 1000000000 |
| Zeros | 132449 |
| Zeros (%) | 58.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 64900 |
| 95-th percentile | 1046668.2 |
| Maximum | 1000000000 |
| Range | 1000000000 |
| Interquartile range (IQR) | 64900 |
Descriptive statistics
| Standard deviation | 2411721.515 |
|---|---|
| Coefficient of variation (CV) | 10.86000806 |
| Kurtosis | 131068.3347 |
| Mean | 222073.6394 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 319.533663 |
| Sum | 5.007605118 × 1010 |
| Variance | 5.816400667 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 132449 | |
| 50000 | 1456 | 0.6% |
| 30000 | 1406 | 0.6% |
| 100000 | 942 | 0.4% |
| 25000 | 935 | 0.4% |
| 40000 | 843 | 0.4% |
| 20000 | 824 | 0.4% |
| 200000 | 593 | 0.3% |
| 60000 | 586 | 0.3% |
| 15000 | 553 | 0.2% |
| Other values (43733) | 84906 |
| Value | Count | Frequency (%) |
| 0 | 132449 | |
| 1 | 35 | < 0.1% |
| 2 | 24 | < 0.1% |
| 3 | 20 | < 0.1% |
| 4 | 20 | < 0.1% |
| 5 | 15 | < 0.1% |
| 6 | 9 | < 0.1% |
| 7 | 13 | < 0.1% |
| 8 | 15 | < 0.1% |
| 9 | 17 | < 0.1% |
| Value | Count | Frequency (%) |
| 1000000000 | 1 | |
| 105865712 | 1 | |
| 100425000 | 1 | |
| 92622816 | 1 | |
| 86323888 | 1 | |
| 80327560 | 1 | |
| 79012752 | 1 | |
| 76128712 | 1 | |
| 69847456 | 1 | |
| 69828000 | 1 |
| Distinct | 47206 |
|---|---|
| Distinct (%) | 20.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 221609.8144 |
| Minimum | 0 |
|---|---|
| Maximum | 1000000000 |
| Zeros | 132559 |
| Zeros (%) | 58.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 62990 |
| 95-th percentile | 1042532 |
| Maximum | 1000000000 |
| Range | 1000000000 |
| Interquartile range (IQR) | 62990 |
Descriptive statistics
| Standard deviation | 2414697.439 |
|---|---|
| Coefficient of variation (CV) | 10.89616652 |
| Kurtosis | 130424.4242 |
| Mean | 221609.8144 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 318.4004683 |
| Sum | 4.997146188 × 1010 |
| Variance | 5.830763724 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 132559 | |
| 50000 | 1354 | 0.6% |
| 30000 | 1302 | 0.6% |
| 100000 | 917 | 0.4% |
| 40000 | 764 | 0.3% |
| 25000 | 739 | 0.3% |
| 20000 | 638 | 0.3% |
| 200000 | 600 | 0.3% |
| 300000 | 539 | 0.2% |
| 60000 | 518 | 0.2% |
| Other values (47196) | 85563 |
| Value | Count | Frequency (%) |
| 0 | 132559 | |
| 1 | 44 | < 0.1% |
| 2 | 25 | < 0.1% |
| 3 | 20 | < 0.1% |
| 4 | 19 | < 0.1% |
| 5 | 15 | < 0.1% |
| 6 | 9 | < 0.1% |
| 7 | 13 | < 0.1% |
| 8 | 15 | < 0.1% |
| 9 | 17 | < 0.1% |
| Value | Count | Frequency (%) |
| 1000000000 | 1 | |
| 105755712 | 1 | |
| 100425000 | 1 | |
| 92628728 | 1 | |
| 86024784 | 1 | |
| 80349168 | 1 | |
| 79012752 | 1 | |
| 76128712 | 1 | |
| 69847456 | 1 | |
| 69715944 | 1 |
| Distinct | 37 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.06012160023 |
| Minimum | 0 |
|---|---|
| Maximum | 52 |
| Zeros | 219731 |
| Zeros (%) | 97.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 52 |
| Range | 52 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.6331042657 |
|---|---|
| Coefficient of variation (CV) | 10.53039612 |
| Kurtosis | 1268.943982 |
| Mean | 0.06012160023 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 27.84235193 |
| Sum | 13557 |
| Variance | 0.4008210112 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 219731 | |
| 1 | 3396 | 1.5% |
| 2 | 1022 | 0.5% |
| 3 | 438 | 0.2% |
| 4 | 289 | 0.1% |
| 5 | 147 | 0.1% |
| 6 | 115 | 0.1% |
| 7 | 75 | < 0.1% |
| 8 | 67 | < 0.1% |
| 9 | 37 | < 0.1% |
| Other values (27) | 176 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 219731 | |
| 1 | 3396 | 1.5% |
| 2 | 1022 | 0.5% |
| 3 | 438 | 0.2% |
| 4 | 289 | 0.1% |
| 5 | 147 | 0.1% |
| 6 | 115 | 0.1% |
| 7 | 75 | < 0.1% |
| 8 | 67 | < 0.1% |
| 9 | 37 | < 0.1% |
| Value | Count | Frequency (%) |
| 52 | 1 | < 0.1% |
| 46 | 2 | |
| 42 | 1 | < 0.1% |
| 38 | 2 | |
| 37 | 1 | < 0.1% |
| 35 | 1 | < 0.1% |
| 34 | 2 | |
| 31 | 4 | |
| 30 | 2 | |
| 29 | 1 | < 0.1% |
| Distinct | 23 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.02821373612 |
| Minimum | 0 |
|---|---|
| Maximum | 36 |
| Zeros | 221737 |
| Zeros (%) | 98.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 36 |
| Range | 36 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3189458665 |
|---|---|
| Coefficient of variation (CV) | 11.30463066 |
| Kurtosis | 1753.782524 |
| Mean | 0.02821373612 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 30.4096604 |
| Sum | 6362 |
| Variance | 0.1017264657 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 221737 | |
| 1 | 2637 | 1.2% |
| 2 | 627 | 0.3% |
| 3 | 193 | 0.1% |
| 4 | 116 | 0.1% |
| 5 | 64 | < 0.1% |
| 6 | 32 | < 0.1% |
| 7 | 22 | < 0.1% |
| 8 | 17 | < 0.1% |
| 9 | 10 | < 0.1% |
| Other values (13) | 38 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 221737 | |
| 1 | 2637 | 1.2% |
| 2 | 627 | 0.3% |
| 3 | 193 | 0.1% |
| 4 | 116 | 0.1% |
| 5 | 64 | < 0.1% |
| 6 | 32 | < 0.1% |
| 7 | 22 | < 0.1% |
| 8 | 17 | < 0.1% |
| 9 | 10 | < 0.1% |
| Value | Count | Frequency (%) |
| 36 | 1 | < 0.1% |
| 26 | 1 | < 0.1% |
| 22 | 2 | |
| 21 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 16 | 2 | |
| 15 | 4 | |
| 14 | 1 | < 0.1% |
| 13 | 3 |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.00736164759 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 224183 |
| Zeros (%) | 99.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.1123006844 |
|---|---|
| Coefficient of variation (CV) | 15.25483025 |
| Kurtosis | 855.9649875 |
| Mean | 0.00736164759 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 24.01431522 |
| Sum | 1660 |
| Variance | 0.01261144371 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 224183 | |
| 1 | 1104 | 0.5% |
| 2 | 124 | 0.1% |
| 3 | 47 | < 0.1% |
| 4 | 19 | < 0.1% |
| 5 | 8 | < 0.1% |
| 6 | 6 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 224183 | |
| 1 | 1104 | 0.5% |
| 2 | 124 | 0.1% |
| 3 | 47 | < 0.1% |
| 4 | 19 | < 0.1% |
| 5 | 8 | < 0.1% |
| 6 | 6 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 6 | 6 | < 0.1% |
| 5 | 8 | < 0.1% |
| 4 | 19 | < 0.1% |
| 3 | 47 | < 0.1% |
| 2 | 124 | 0.1% |
| 1 | 1104 | 0.5% |
| 0 | 224183 |
| Distinct | 3197 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5569.681853 |
| Minimum | -574647 |
|---|---|
| Maximum | 36032852 |
| Zeros | 222182 |
| Zeros (%) | 98.5% |
| Negative | 60 |
| Negative (%) | < 0.1% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | -574647 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 36032852 |
| Range | 36607499 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 172928.1293 |
|---|---|
| Coefficient of variation (CV) | 31.04811619 |
| Kurtosis | 16744.54951 |
| Mean | 5569.681853 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 107.0091863 |
| Sum | 1255924270 |
| Variance | 2.990413791 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 222182 | |
| 800 | 10 | < 0.1% |
| 100 | 8 | < 0.1% |
| 400 | 8 | < 0.1% |
| 1200 | 6 | < 0.1% |
| 589 | 5 | < 0.1% |
| -1 | 5 | < 0.1% |
| 1600 | 4 | < 0.1% |
| 1070 | 4 | < 0.1% |
| 1 | 4 | < 0.1% |
| Other values (3187) | 3257 | 1.4% |
| Value | Count | Frequency (%) |
| -574647 | 1 | |
| -239782 | 1 | |
| -155527 | 1 | |
| -117138 | 1 | |
| -31290 | 1 | |
| -20000 | 1 | |
| -9625 | 1 | |
| -8606 | 1 | |
| -7730 | 1 | |
| -7370 | 1 |
| Value | Count | Frequency (%) |
| 36032852 | 1 | |
| 29560540 | 1 | |
| 24692024 | 1 | |
| 22497172 | 1 | |
| 19638280 | 1 | |
| 13607882 | 1 | |
| 12080102 | 1 | |
| 10779261 | 1 | |
| 10716039 | 1 | |
| 9801134 | 1 |
| Distinct | 2195 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7489.1866 |
| Minimum | 0 |
|---|---|
| Maximum | 30000000 |
| Zeros | 221816 |
| Zeros (%) | 98.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 30000000 |
| Range | 30000000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 186043.2484 |
|---|---|
| Coefficient of variation (CV) | 24.84158272 |
| Kurtosis | 8423.825453 |
| Mean | 7489.1866 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 74.21689332 |
| Sum | 1688759154 |
| Variance | 3.461209027 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 221816 | |
| 50000 | 82 | < 0.1% |
| 100000 | 60 | < 0.1% |
| 30000 | 43 | < 0.1% |
| 200000 | 38 | < 0.1% |
| 40000 | 37 | < 0.1% |
| 15000 | 36 | < 0.1% |
| 25000 | 34 | < 0.1% |
| 10000 | 32 | < 0.1% |
| 300000 | 30 | < 0.1% |
| Other values (2185) | 3285 | 1.5% |
| Value | Count | Frequency (%) |
| 0 | 221816 | |
| 1 | 6 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 52 | 1 | < 0.1% |
| 54 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 30000000 | 1 | |
| 26888200 | 1 | |
| 25000000 | 1 | |
| 19800000 | 1 | |
| 18691002 | 1 | |
| 13607882 | 1 | |
| 12626000 | 1 | |
| 12511990 | 1 | |
| 12014300 | 1 | |
| 11900000 | 1 |
| Distinct | 2519 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7371.103569 |
| Minimum | 0 |
|---|---|
| Maximum | 30000000 |
| Zeros | 221846 |
| Zeros (%) | 98.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 30000000 |
| Range | 30000000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 185470.3088 |
|---|---|
| Coefficient of variation (CV) | 25.16181017 |
| Kurtosis | 8521.088802 |
| Mean | 7371.103569 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 74.71985798 |
| Sum | 1662132257 |
| Variance | 3.439923543 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 221846 | |
| 50000 | 58 | < 0.1% |
| 100000 | 46 | < 0.1% |
| 200000 | 36 | < 0.1% |
| 300000 | 29 | < 0.1% |
| 40000 | 29 | < 0.1% |
| 30000 | 26 | < 0.1% |
| 500000 | 25 | < 0.1% |
| 150000 | 23 | < 0.1% |
| 400000 | 21 | < 0.1% |
| Other values (2509) | 3354 | 1.5% |
| Value | Count | Frequency (%) |
| 0 | 221846 | |
| 1 | 5 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 52 | 1 | < 0.1% |
| 54 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 30000000 | 1 | |
| 26888200 | 1 | |
| 25000000 | 1 | |
| 19800000 | 1 | |
| 18691002 | 1 | |
| 13607882 | 1 | |
| 12626000 | 1 | |
| 12511990 | 1 | |
| 12014300 | 1 | |
| 11900000 | 1 |
| Distinct | 27608 |
|---|---|
| Distinct (%) | 12.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12992.45694 |
| Minimum | 0 |
|---|---|
| Maximum | 25642806 |
| Zeros | 153544 |
| Zeros (%) | 68.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2045 |
| 95-th percentile | 26361.4 |
| Maximum | 25642806 |
| Range | 25642806 |
| Interquartile range (IQR) | 2045 |
Descriptive statistics
| Standard deviation | 149708.4301 |
|---|---|
| Coefficient of variation (CV) | 11.52271897 |
| Kurtosis | 8574.41736 |
| Mean | 12992.45694 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 71.5253121 |
| Sum | 2929708092 |
| Variance | 2.241261403 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 153544 | |
| 1620 | 287 | 0.1% |
| 1500 | 152 | 0.1% |
| 2000 | 140 | 0.1% |
| 1600 | 139 | 0.1% |
| 2500 | 133 | 0.1% |
| 1149 | 128 | 0.1% |
| 1250 | 121 | 0.1% |
| 1700 | 109 | < 0.1% |
| 1350 | 100 | < 0.1% |
| Other values (27598) | 70640 |
| Value | Count | Frequency (%) |
| 0 | 153544 | |
| 1 | 5 | < 0.1% |
| 2 | 4 | < 0.1% |
| 3 | 19 | < 0.1% |
| 4 | 15 | < 0.1% |
| 5 | 12 | < 0.1% |
| 6 | 22 | < 0.1% |
| 7 | 13 | < 0.1% |
| 8 | 13 | < 0.1% |
| 9 | 18 | < 0.1% |
| Value | Count | Frequency (%) |
| 25642806 | 1 | |
| 20766553 | 1 | |
| 17408822 | 1 | |
| 15518546 | 1 | |
| 15420411 | 1 | |
| 15019914 | 1 | |
| 14599252 | 1 | |
| 11305579 | 1 | |
| 8470059 | 1 | |
| 7663110 | 1 |
| Distinct | 1890 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 325.684478 |
| Minimum | 0 |
|---|---|
| Maximum | 4170901 |
| Zeros | 223313 |
| Zeros (%) | 99.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 4170901 |
| Range | 4170901 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 15756.16957 |
|---|---|
| Coefficient of variation (CV) | 48.37863217 |
| Kurtosis | 32466.62925 |
| Mean | 325.684478 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 152.8457066 |
| Sum | 73439570 |
| Variance | 248256879.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 223313 | |
| 2100 | 7 | < 0.1% |
| 1232 | 6 | < 0.1% |
| 1065 | 6 | < 0.1% |
| 1100 | 6 | < 0.1% |
| 5000 | 6 | < 0.1% |
| 1167 | 5 | < 0.1% |
| 833 | 5 | < 0.1% |
| 1565 | 5 | < 0.1% |
| 1834 | 5 | < 0.1% |
| Other values (1880) | 2129 | 0.9% |
| Value | Count | Frequency (%) |
| 0 | 223313 | |
| 1 | 3 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 12 | 2 | < 0.1% |
| 16 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 4170901 | 1 | |
| 3246710 | 1 | |
| 1814000 | 1 | |
| 1661220 | 1 | |
| 1589946 | 1 | |
| 1447600 | 1 | |
| 1231166 | 1 | |
| 1113118 | 1 | |
| 1020000 | 1 | |
| 842483 | 1 |
| Distinct | 26 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3866018014 |
| Minimum | 0 |
|---|---|
| Maximum | 35 |
| Zeros | 174944 |
| Zeros (%) | 77.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 35 |
| Range | 35 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.9596677387 |
|---|---|
| Coefficient of variation (CV) | 2.482315745 |
| Kurtosis | 46.36151075 |
| Mean | 0.3866018014 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.786404025 |
| Sum | 87176 |
| Variance | 0.9209621688 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 174944 | |
| 1 | 31361 | 13.9% |
| 2 | 10806 | 4.8% |
| 3 | 4375 | 1.9% |
| 4 | 1918 | 0.9% |
| 5 | 947 | 0.4% |
| 6 | 473 | 0.2% |
| 7 | 293 | 0.1% |
| 8 | 143 | 0.1% |
| 9 | 78 | < 0.1% |
| Other values (16) | 155 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 174944 | |
| 1 | 31361 | 13.9% |
| 2 | 10806 | 4.8% |
| 3 | 4375 | 1.9% |
| 4 | 1918 | 0.9% |
| 5 | 947 | 0.4% |
| 6 | 473 | 0.2% |
| 7 | 293 | 0.1% |
| 8 | 143 | 0.1% |
| 9 | 78 | < 0.1% |
| Value | Count | Frequency (%) |
| 35 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 23 | 2 | < 0.1% |
| 22 | 1 | < 0.1% |
| 21 | 1 | < 0.1% |
| 20 | 2 | < 0.1% |
| 19 | 2 | < 0.1% |
| 18 | 2 | < 0.1% |
| 17 | 5 | |
| 16 | 6 |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.09870816389 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 207647 |
| Zeros (%) | 92.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3863763545 |
|---|---|
| Coefficient of variation (CV) | 3.914330277 |
| Kurtosis | 98.8217849 |
| Mean | 0.09870816389 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.620306255 |
| Sum | 22258 |
| Variance | 0.1492866873 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 207647 | |
| 1 | 14680 | 6.5% |
| 2 | 2405 | 1.1% |
| 3 | 519 | 0.2% |
| 4 | 136 | 0.1% |
| 5 | 56 | < 0.1% |
| 6 | 20 | < 0.1% |
| 7 | 12 | < 0.1% |
| 8 | 7 | < 0.1% |
| 12 | 3 | < 0.1% |
| Other values (4) | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 207647 | |
| 1 | 14680 | 6.5% |
| 2 | 2405 | 1.1% |
| 3 | 519 | 0.2% |
| 4 | 136 | 0.1% |
| 5 | 56 | < 0.1% |
| 6 | 20 | < 0.1% |
| 7 | 12 | < 0.1% |
| 8 | 7 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 12 | 3 | < 0.1% |
| 11 | 3 | < 0.1% |
| 10 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| 8 | 7 | < 0.1% |
| 7 | 12 | < 0.1% |
| 6 | 20 | < 0.1% |
| 5 | 56 | |
| 4 | 136 |
| Distinct | 192 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| 0yrs 0mon | |
|---|---|
| 0yrs 6mon | 5907 |
| 0yrs 7mon | 5254 |
| 0yrs 11mon | 5110 |
| 0yrs 10mon | 5005 |
| Other values (187) |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 9.07802016 |
| Min length | 9 |
Characters and Unicode
| Total characters | 2047030 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 21 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0yrs 0mon |
|---|---|
| 2nd row | 1yrs 11mon |
| 3rd row | 0yrs 0mon |
| 4th row | 0yrs 8mon |
| 5th row | 0yrs 0mon |
Common Values
| Value | Count | Frequency (%) |
| 0yrs 0mon | 114135 | |
| 0yrs 6mon | 5907 | 2.6% |
| 0yrs 7mon | 5254 | 2.3% |
| 0yrs 11mon | 5110 | 2.3% |
| 0yrs 10mon | 5005 | 2.2% |
| 0yrs 9mon | 4895 | 2.2% |
| 1yrs 0mon | 4890 | 2.2% |
| 0yrs 8mon | 4785 | 2.1% |
| 1yrs 1mon | 4363 | 1.9% |
| 0yrs 5mon | 4271 | 1.9% |
| Other values (182) | 66878 |
Length
| Value | Count | Frequency (%) |
| 0yrs | 162056 | |
| 0mon | 122525 | |
| 1yrs | 35860 | 8.0% |
| 2yrs | 14563 | 3.2% |
| 6mon | 10866 | 2.4% |
| 1mon | 9887 | 2.2% |
| 7mon | 9685 | 2.1% |
| 4mon | 9571 | 2.1% |
| 3mon | 9508 | 2.1% |
| 2mon | 9464 | 2.1% |
| Other values (24) | 57001 | 12.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 293340 | |
| r | 225493 | |
| s | 225493 | |
| 225493 | ||
| m | 225493 | |
| o | 225493 | |
| n | 225493 | |
| y | 225493 | |
| 1 | 72046 | 3.5% |
| 2 | 24086 | 1.2% |
| Other values (7) | 79107 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1352958 | |
| Decimal Number | 468579 | 22.9% |
| Space Separator | 225493 | 11.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 293340 | |
| 1 | 72046 | 15.4% |
| 2 | 24086 | 5.1% |
| 3 | 16041 | 3.4% |
| 4 | 12576 | 2.7% |
| 6 | 11659 | 2.5% |
| 5 | 10911 | 2.3% |
| 7 | 10142 | 2.2% |
| 8 | 8993 | 1.9% |
| 9 | 8785 | 1.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 225493 | |
| s | 225493 | |
| m | 225493 | |
| o | 225493 | |
| n | 225493 | |
| y | 225493 |
Space Separator
| Value | Count | Frequency (%) |
| 225493 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1352958 | |
| Common | 694072 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 293340 | |
| 225493 | ||
| 1 | 72046 | 10.4% |
| 2 | 24086 | 3.5% |
| 3 | 16041 | 2.3% |
| 4 | 12576 | 1.8% |
| 6 | 11659 | 1.7% |
| 5 | 10911 | 1.6% |
| 7 | 10142 | 1.5% |
| 8 | 8993 | 1.3% |
Latin
| Value | Count | Frequency (%) |
| r | 225493 | |
| s | 225493 | |
| m | 225493 | |
| o | 225493 | |
| n | 225493 | |
| y | 225493 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2047030 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 293340 | |
| r | 225493 | |
| s | 225493 | |
| 225493 | ||
| m | 225493 | |
| o | 225493 | |
| n | 225493 | |
| y | 225493 | |
| 1 | 72046 | 3.5% |
| 2 | 24086 | 1.2% |
| Other values (7) | 79107 | 3.9% |
| Distinct | 291 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| 0yrs 0mon | |
|---|---|
| 0yrs 6mon | 4670 |
| 2yrs 1mon | 4596 |
| 0yrs 7mon | 3952 |
| 2yrs 0mon | 3711 |
| Other values (286) |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 9.092490676 |
| Min length | 9 |
Characters and Unicode
| Total characters | 2050293 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 39 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0yrs 0mon |
|---|---|
| 2nd row | 1yrs 11mon |
| 3rd row | 0yrs 0mon |
| 4th row | 1yrs 3mon |
| 5th row | 0yrs 0mon |
Common Values
| Value | Count | Frequency (%) |
| 0yrs 0mon | 113894 | |
| 0yrs 6mon | 4670 | 2.1% |
| 2yrs 1mon | 4596 | 2.0% |
| 0yrs 7mon | 3952 | 1.8% |
| 2yrs 0mon | 3711 | 1.6% |
| 1yrs 0mon | 3290 | 1.5% |
| 1yrs 1mon | 2960 | 1.3% |
| 0yrs 11mon | 2564 | 1.1% |
| 0yrs 8mon | 2401 | 1.1% |
| 0yrs 9mon | 2351 | 1.0% |
| Other values (281) | 81104 |
Length
| Value | Count | Frequency (%) |
| 0yrs | 141964 | |
| 0mon | 125313 | |
| 1yrs | 26011 | 5.8% |
| 2yrs | 22097 | 4.9% |
| 1mon | 13351 | 3.0% |
| 3yrs | 11669 | 2.6% |
| 6mon | 11021 | 2.4% |
| 7mon | 10010 | 2.2% |
| 2mon | 9140 | 2.0% |
| 11mon | 8800 | 2.0% |
| Other values (37) | 71610 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 276257 | |
| r | 225493 | |
| s | 225493 | |
| 225493 | ||
| m | 225493 | |
| o | 225493 | |
| n | 225493 | |
| y | 225493 | |
| 1 | 70096 | 3.4% |
| 2 | 32046 | 1.6% |
| Other values (7) | 93443 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1352958 | |
| Decimal Number | 471842 | 23.0% |
| Space Separator | 225493 | 11.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 276257 | |
| 1 | 70096 | 14.9% |
| 2 | 32046 | 6.8% |
| 3 | 20739 | 4.4% |
| 4 | 15656 | 3.3% |
| 6 | 13961 | 3.0% |
| 5 | 12996 | 2.8% |
| 7 | 11955 | 2.5% |
| 8 | 9215 | 2.0% |
| 9 | 8921 | 1.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 225493 | |
| s | 225493 | |
| m | 225493 | |
| o | 225493 | |
| n | 225493 | |
| y | 225493 |
Space Separator
| Value | Count | Frequency (%) |
| 225493 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1352958 | |
| Common | 697335 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 276257 | |
| 225493 | ||
| 1 | 70096 | 10.1% |
| 2 | 32046 | 4.6% |
| 3 | 20739 | 3.0% |
| 4 | 15656 | 2.2% |
| 6 | 13961 | 2.0% |
| 5 | 12996 | 1.9% |
| 7 | 11955 | 1.7% |
| 8 | 9215 | 1.3% |
Latin
| Value | Count | Frequency (%) |
| r | 225493 | |
| s | 225493 | |
| m | 225493 | |
| o | 225493 | |
| n | 225493 | |
| y | 225493 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2050293 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 276257 | |
| r | 225493 | |
| s | 225493 | |
| 225493 | ||
| m | 225493 | |
| o | 225493 | |
| n | 225493 | |
| y | 225493 | |
| 1 | 70096 | 3.4% |
| 2 | 32046 | 1.6% |
| Other values (7) | 93443 | 4.6% |
| Distinct | 25 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2088446205 |
| Minimum | 0 |
|---|---|
| Maximum | 36 |
| Zeros | 194990 |
| Zeros (%) | 86.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 36 |
| Range | 36 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.7100854515 |
|---|---|
| Coefficient of variation (CV) | 3.4000658 |
| Kurtosis | 132.0497536 |
| Mean | 0.2088446205 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.862951512 |
| Sum | 47093 |
| Variance | 0.5042213484 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 194990 | |
| 1 | 21794 | 9.7% |
| 2 | 5294 | 2.3% |
| 3 | 1724 | 0.8% |
| 4 | 745 | 0.3% |
| 5 | 331 | 0.1% |
| 6 | 234 | 0.1% |
| 7 | 133 | 0.1% |
| 8 | 103 | < 0.1% |
| 9 | 41 | < 0.1% |
| Other values (15) | 104 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 194990 | |
| 1 | 21794 | 9.7% |
| 2 | 5294 | 2.3% |
| 3 | 1724 | 0.8% |
| 4 | 745 | 0.3% |
| 5 | 331 | 0.1% |
| 6 | 234 | 0.1% |
| 7 | 133 | 0.1% |
| 8 | 103 | < 0.1% |
| 9 | 41 | < 0.1% |
| Value | Count | Frequency (%) |
| 36 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 19 | 6 | |
| 18 | 4 | |
| 17 | 4 | |
| 16 | 3 | |
| 15 | 7 |
loan_default
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 225493 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 176526 | |
| 1 | 48967 | 21.7% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 176526 | |
| 1 | 48967 | 21.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 176526 | |
| 1 | 48967 | 21.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 225493 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 176526 | |
| 1 | 48967 | 21.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 225493 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 176526 | |
| 1 | 48967 | 21.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 225493 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 176526 | |
| 1 | 48967 | 21.7% |
Auto
The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | UniqueID | disbursed_amount | asset_cost | ltv | branch_id | supplier_id | manufacturer_id | Current_pincode_ID | Date.of.Birth | Employment.Type | DisbursalDate | State_ID | Employee_code_ID | Aadhar_flag | PAN_flag | VoterID_flag | Driving_flag | Passport_flag | PERFORM_CNS.SCORE | PERFORM_CNS.SCORE.DESCRIPTION | PRI.NO.OF.ACCTS | PRI.ACTIVE.ACCTS | PRI.OVERDUE.ACCTS | PRI.CURRENT.BALANCE | PRI.SANCTIONED.AMOUNT | PRI.DISBURSED.AMOUNT | SEC.NO.OF.ACCTS | SEC.ACTIVE.ACCTS | SEC.OVERDUE.ACCTS | SEC.CURRENT.BALANCE | SEC.SANCTIONED.AMOUNT | SEC.DISBURSED.AMOUNT | PRIMARY.INSTAL.AMT | SEC.INSTAL.AMT | NEW.ACCTS.IN.LAST.SIX.MONTHS | DELINQUENT.ACCTS.IN.LAST.SIX.MONTHS | AVERAGE.ACCT.AGE | CREDIT.HISTORY.LENGTH | NO.OF_INQUIRIES | loan_default | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 420825 | 50578 | 58400 | 89.55 | 67 | 22807 | 45 | 1441 | 1984-01-01 | Salaried | 2018-03-08 | 6 | 1998 | True | False | False | False | False | 0 | No Bureau History Available | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0yrs 0mon | 0yrs 0mon | 0 | 0 |
| 1 | 1 | 537409 | 47145 | 65550 | 73.23 | 67 | 22807 | 45 | 1502 | 1985-07-31 | Self employed | 2018-09-26 | 6 | 1998 | True | False | False | False | False | 598 | I-Medium Risk | 1 | 1 | 1 | 27600 | 50200 | 50200 | 0 | 0 | 0 | 0 | 0 | 0 | 1991 | 0 | 0 | 1 | 1yrs 11mon | 1yrs 11mon | 0 | 1 |
| 2 | 2 | 417566 | 53278 | 61360 | 89.63 | 67 | 22807 | 45 | 1497 | 1985-08-24 | Self employed | 2018-01-08 | 6 | 1998 | True | False | False | False | False | 0 | No Bureau History Available | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0yrs 0mon | 0yrs 0mon | 0 | 0 |
| 3 | 3 | 624493 | 57513 | 66113 | 88.48 | 67 | 22807 | 45 | 1501 | 1993-12-30 | Self employed | 2018-10-26 | 6 | 1998 | True | False | False | False | False | 305 | L-Very High Risk | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 31 | 0 | 0 | 0 | 0yrs 8mon | 1yrs 3mon | 1 | 1 |
| 4 | 4 | 539055 | 52378 | 60300 | 88.39 | 67 | 22807 | 45 | 1495 | 1977-09-12 | Self employed | 2018-09-26 | 6 | 1998 | True | False | False | False | False | 0 | No Bureau History Available | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0yrs 0mon | 0yrs 0mon | 1 | 1 |
| 5 | 5 | 518279 | 54513 | 61900 | 89.66 | 67 | 22807 | 45 | 1501 | 1990-08-09 | Self employed | 2018-09-19 | 6 | 1998 | True | False | False | False | False | 825 | A-Very Low Risk | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1347 | 0 | 0 | 0 | 1yrs 9mon | 2yrs 0mon | 0 | 0 |
| 6 | 6 | 529269 | 46349 | 61500 | 76.42 | 67 | 22807 | 45 | 1502 | 1988-01-06 | Salaried | 2018-09-23 | 6 | 1998 | True | False | False | False | False | 0 | No Bureau History Available | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0yrs 0mon | 0yrs 0mon | 0 | 0 |
| 7 | 7 | 510278 | 43894 | 61900 | 71.89 | 67 | 22807 | 45 | 1501 | 1989-04-10 | Salaried | 2018-09-16 | 6 | 1998 | True | False | False | False | False | 17 | Not Scored: Not Enough Info available on the customer | 1 | 1 | 0 | 72879 | 74500 | 74500 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0yrs 2mon | 0yrs 2mon | 0 | 0 |
| 8 | 8 | 490213 | 53713 | 61973 | 89.56 | 67 | 22807 | 45 | 1497 | 1991-11-15 | Self employed | 2018-05-09 | 6 | 1998 | True | False | False | False | False | 718 | D-Very Low Risk | 1 | 1 | 0 | -41 | 365384 | 365384 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4yrs 8mon | 4yrs 8mon | 1 | 0 |
| 9 | 9 | 510980 | 52603 | 61300 | 86.95 | 67 | 22807 | 45 | 1492 | 2068-01-06 | Salaried | 2018-09-16 | 6 | 1998 | False | False | True | False | False | 818 | A-Very Low Risk | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2608 | 0 | 0 | 0 | 1yrs 7mon | 1yrs 7mon | 0 | 0 |
Last rows
| df_index | UniqueID | disbursed_amount | asset_cost | ltv | branch_id | supplier_id | manufacturer_id | Current_pincode_ID | Date.of.Birth | Employment.Type | DisbursalDate | State_ID | Employee_code_ID | Aadhar_flag | PAN_flag | VoterID_flag | Driving_flag | Passport_flag | PERFORM_CNS.SCORE | PERFORM_CNS.SCORE.DESCRIPTION | PRI.NO.OF.ACCTS | PRI.ACTIVE.ACCTS | PRI.OVERDUE.ACCTS | PRI.CURRENT.BALANCE | PRI.SANCTIONED.AMOUNT | PRI.DISBURSED.AMOUNT | SEC.NO.OF.ACCTS | SEC.ACTIVE.ACCTS | SEC.OVERDUE.ACCTS | SEC.CURRENT.BALANCE | SEC.SANCTIONED.AMOUNT | SEC.DISBURSED.AMOUNT | PRIMARY.INSTAL.AMT | SEC.INSTAL.AMT | NEW.ACCTS.IN.LAST.SIX.MONTHS | DELINQUENT.ACCTS.IN.LAST.SIX.MONTHS | AVERAGE.ACCT.AGE | CREDIT.HISTORY.LENGTH | NO.OF_INQUIRIES | loan_default | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 225483 | 233144 | 613161 | 56059 | 69001 | 83.04 | 34 | 23024 | 86 | 1044 | 2063-06-15 | Salaried | 2018-10-24 | 6 | 3705 | True | False | False | False | False | 0 | No Bureau History Available | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0yrs 0mon | 0yrs 0mon | 0 | 0 |
| 225484 | 233145 | 606146 | 49803 | 66973 | 76.15 | 34 | 21081 | 45 | 1051 | 1985-12-23 | Self employed | 2018-10-23 | 6 | 3705 | True | False | False | False | False | 690 | E-Low Risk | 7 | 4 | 0 | 13064 | 85629 | 80226 | 0 | 0 | 0 | 0 | 0 | 0 | 1672 | 0 | 2 | 0 | 0yrs 9mon | 2yrs 6mon | 1 | 0 |
| 225485 | 233146 | 622612 | 38439 | 52965 | 74.58 | 34 | 20700 | 48 | 1051 | 1982-07-23 | Self employed | 2018-10-26 | 6 | 3705 | True | False | False | False | False | 738 | C-Very Low Risk | 2 | 2 | 0 | 7001 | 14839 | 14839 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0yrs 3mon | 0yrs 3mon | 0 | 0 |
| 225486 | 233147 | 645697 | 72623 | 105405 | 69.73 | 34 | 20700 | 48 | 1051 | 1989-06-19 | Salaried | 2018-10-31 | 6 | 3705 | True | False | False | False | False | 755 | C-Very Low Risk | 4 | 4 | 0 | 201422 | 276624 | 237977 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0yrs 9mon | 1yrs 0mon | 0 | 0 |
| 225487 | 233148 | 613494 | 42894 | 60334 | 72.93 | 34 | 20700 | 48 | 1051 | 1993-08-07 | Salaried | 2018-10-24 | 6 | 3705 | True | False | False | False | False | 0 | No Bureau History Available | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0yrs 0mon | 0yrs 0mon | 0 | 0 |
| 225488 | 233149 | 626432 | 63213 | 105405 | 60.72 | 34 | 20700 | 48 | 1050 | 1988-01-08 | Salaried | 2018-10-26 | 6 | 3705 | False | False | True | False | False | 735 | D-Very Low Risk | 4 | 3 | 0 | 390443 | 416133 | 416133 | 0 | 0 | 0 | 0 | 0 | 0 | 4084 | 0 | 0 | 0 | 1yrs 9mon | 3yrs 3mon | 0 | 0 |
| 225489 | 233150 | 606141 | 73651 | 100600 | 74.95 | 34 | 23775 | 51 | 990 | 1988-05-12 | Self employed | 2018-10-23 | 6 | 3705 | False | False | True | False | False | 825 | A-Very Low Risk | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1565 | 0 | 0 | 0 | 0yrs 6mon | 0yrs 6mon | 0 | 0 |
| 225490 | 233151 | 613658 | 33484 | 71212 | 48.45 | 77 | 22186 | 86 | 2299 | 1976-01-06 | Salaried | 2018-10-24 | 4 | 3479 | True | False | False | False | False | 0 | No Bureau History Available | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0yrs 0mon | 0yrs 0mon | 0 | 0 |
| 225491 | 233152 | 548084 | 34259 | 73286 | 49.10 | 77 | 22186 | 86 | 2299 | 1994-03-26 | Salaried | 2018-09-29 | 4 | 3479 | True | False | False | False | False | 0 | No Bureau History Available | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0yrs 0mon | 0yrs 0mon | 0 | 0 |
| 225492 | 233153 | 630213 | 75751 | 116009 | 66.81 | 77 | 22186 | 86 | 2299 | 1984-02-18 | Salaried | 2018-10-27 | 4 | 3479 | True | False | False | False | False | 0 | No Bureau History Available | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0yrs 0mon | 0yrs 0mon | 0 | 0 |